[responsesAPI][bugfix] serialize harmony messages by qandrew · Pull Request #26185 · vllm-project/vllm

qandrew · 2025-10-03T17:24:52Z

Purpose

harmony messages are not being serialized properly, this was the state prior to this PR.

context.parser.messages[0]
Message(author=Author(role=<Role.ASSISTANT: 'assistant'>, name=None), content=[TextContent(text='We need to respond as ChatGPT. The user says "Hello." We respond politely. Possibly ask how can help.')], channel='analysis', recipient=None, content_type=None)
context.parser.messages[0].model_dump_json()
'{"author":{"role":"assistant","name":null},"content":[{}],"channel":"analysis","recipient":null,"content_type":null}'

We fix it by adding a custom serialization method in protocol.py. I filed a issue: openai/harmony#78

Test Plan

added unit tests, and ran locally successfully

server

 CUDA_VISIBLE_DEVICES=2,3 with-proxy vllm serve "/data/users/axia/checkpoints/gpt-oss-120b" -tp 2 --port 20001

client

curl http://localhost:20001/v1/responses   -H "Content-Type: application/json"   -N   -d '{
    "model": "/data/users/axia/checkpoints/gpt-oss-120b",
    "input": [
        {
            "role": "user",
            "content": "Hello."
        }
    ],
    "temperature": 0.7,
    "max_output_tokens": 256,
    "stream": true,
    "enable_response_messages": true
}'


...


event: response.completed
data: {"response":{"id":"resp_f6111e47923e423e9a21c35775af604c","created_at":1759515694,"incomplete_details":null,"instructions":null,"metadata":null,"model":"/data/users/axia/checkpoints/gpt-oss-120b","object":"response","output":[{"id":"rs_6f2b920ea35f4bbb99e6ca2cee9580de","summary":[],"type":"reasoning","content":[{"text":"We need to respond as ChatGPT, friendly greeting. Probably ask how can help.","type":"reasoning_text"}],"encrypted_content":null,"status":null},{"id":"msg_6b9222d5e23a46d4becc526cf101d085","content":[{"annotations":[],"text":"Hello! How can I assist you today?","type":"output_text","logprobs":null}],"role":"assistant","status":"completed","type":"message"}],"parallel_tool_calls":true,"temperature":0.7,"tool_choice":"auto","tools":[],"top_p":1.0,"background":false,"max_output_tokens":256,"max_tool_calls":null,"previous_response_id":null,"prompt":null,"reasoning":null,"service_tier":"auto","status":"completed","text":null,"top_logprobs":null,"truncation":"disabled","usage":{"input_tokens":67,"input_tokens_details":{"cached_tokens":0},"output_tokens":36,"output_tokens_details":{"reasoning_tokens":18,"tool_output_tokens":0},"total_tokens":103},"user":null,"input_messages":[{"author":{"role":"system","name":null},"content":[{"model_identity":"You are ChatGPT, a large language model trained by OpenAI.","reasoning_effort":"Medium","conversation_start_date":"2025-10-03","knowledge_cutoff":"2024-06","channel_config":{"valid_channels":["analysis","final"],"channel_required":true},"tools":null}],"channel":null,"recipient":null,"content_type":null},{"author":{"role":"user","name":null},"content":[{"text":"Hello."}],"channel":null,"recipient":null,"content_type":null}],"output_messages":[{"author":{"role":"assistant","name":null},"content":[{"text":"We need to respond as ChatGPT, friendly greeting. Probably ask how can help."}],"channel":"analysis","recipient":null,"content_type":null},{"author":{"role":"assistant","name":null},"content":[{"text":"Hello! How can I assist you today?"}],"channel":"final","recipient":null,"content_type":null}]},"sequence_number":38,"type":"response.completed"}

^ note output_messages has content as text and not null (that was the previous bug).

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Andrew Xia <axia@meta.com>

qandrew · 2025-10-03T20:37:47Z

cc @lacora , @houseroad , @yeqcharlotte , @alecsolder this is ready for review

vllm/entrypoints/openai/protocol.py

tests/entrypoints/openai/test_response_api_with_harmony.py

Signed-off-by: Andrew Xia <axia@meta.com>

yeqcharlotte

thanks for adding the tests

Jialin · 2025-10-10T21:35:38Z

vllm/entrypoints/openai/protocol.py

+            serialized = []
+            for m in msgs:
+                if isinstance(m, dict):
+                    serialized.append(m)
+                elif hasattr(m, "__dict__"):
+                    serialized.append(m.to_dict())
+                else:
+                    # fallback to pyandic dump
+                    serialized.append(m.model_dump_json())
+            return serialized


Seems we could consolidate majority of the code (e.g. message to serialized_message mapping) to a separate function? It could allow us to

replace list.append with a simple list comprehensive (with better readability and better performance)

better individual unit tests.

Trying to refactor it a bit in #26620

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

fix serialize harmony initial commit

9164e52

Signed-off-by: Andrew Xia <axia@meta.com>

qandrew changed the title ~~[responseAPI] fix serialize harmony initial commit~~ [responseAPI][bugfix] serialize harmony messages Oct 3, 2025

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Oct 3, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Oct 3, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Oct 3, 2025

fix serialization

e955754

Signed-off-by: Andrew Xia <axia@meta.com>

qandrew mentioned this pull request Oct 3, 2025

Messages do not serialize properly openai/harmony#78

Open

more comments

b3b9249

Signed-off-by: Andrew Xia <axia@meta.com>

qandrew marked this pull request as ready for review October 3, 2025 18:27

qandrew requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and simon-mo as code owners October 3, 2025 18:27

qandrew changed the title ~~[responseAPI][bugfix] serialize harmony messages~~ [responsesAPI][bugfix] serialize harmony messages Oct 3, 2025

alecsolder reviewed Oct 3, 2025

View reviewed changes

vllm/entrypoints/openai/protocol.py Show resolved Hide resolved

vllm/entrypoints/openai/protocol.py Show resolved Hide resolved

tests/entrypoints/openai/test_response_api_with_harmony.py Show resolved Hide resolved

qandrew and others added 4 commits October 3, 2025 16:47

harmony verify

230242c

Signed-off-by: Andrew Xia <axia@meta.com>

Merge branch 'main' into fix-serialize-harmony

cc723bd

Signed-off-by: Andrew Xia <axia@meta.com>

test harmony properly

b2980a8

Signed-off-by: Andrew Xia <axia@meta.com>

Merge branch 'main' into fix-serialize-harmony

2a05647

yeqcharlotte approved these changes Oct 7, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Oct 7, 2025

yeqcharlotte enabled auto-merge (squash) October 7, 2025 05:18

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 7, 2025

yeqcharlotte merged commit 185d8ed into vllm-project:main Oct 7, 2025
49 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Oct 7, 2025

qandrew mentioned this pull request Oct 9, 2025

fix json schema alias serializing when streaming #26356

Open

6 tasks

Jialin reviewed Oct 10, 2025

View reviewed changes

Jialin mentioned this pull request Oct 11, 2025

[ResponseAPI] Simplify input/output message serialization #26620

Merged

5 tasks

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[responsesAPI][bugfix] serialize harmony messages (vllm-project#26185)

0451ab5

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

jacobthebanana mentioned this pull request Oct 23, 2025

[Bugfix] Actually enable serialize_messages for harmony Responses (related to #26185) #27377

Open

5 tasks

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[responsesAPI][bugfix] serialize harmony messages (vllm-project#26185)

d86945f

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[responsesAPI][bugfix] serialize harmony messages (vllm-project#26185)

e6e737b

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[responsesAPI][bugfix] serialize harmony messages (vllm-project#26185)

8b97115

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

This was referenced Dec 2, 2025

Added regression test for openai/harmony/issues/78 #29830

Open

[Bug]: v1/responses enable_response_messages returns blank message content #29831

Open

This was referenced Mar 3, 2026

[Responses API] Structured output + reasoning via structural tag embedding #35873

Closed

[Responses API] Structured output + reasoning via structural tag embedding #35904

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[responsesAPI][bugfix] serialize harmony messages#26185

[responsesAPI][bugfix] serialize harmony messages#26185
yeqcharlotte merged 7 commits intovllm-project:mainfrom
qandrew:fix-serialize-harmony

qandrew commented Oct 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

qandrew commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yeqcharlotte left a comment

Uh oh!

Uh oh!

Jialin Oct 10, 2025

Uh oh!

Jialin Oct 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

qandrew commented Oct 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

qandrew commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yeqcharlotte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Jialin Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Jialin Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

qandrew commented Oct 3, 2025 •

edited by github-actions bot

Loading